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Abstract. We develop a general method for proving that certain star config- 
urations in finite-dimensional normed spaces are Steiner minimal trees. This 
method generalises the results of Lawlor and Morgan (1994) that could only 
be applied to differentiable norms. The generalisation uses the sub-differential 
calculus from convex analysis. We apply this method to two special norms. 
The first norm, occurring in the work of Cieslik, has unit ball the polar of the 
difference body of the n-simplex (in dimension 3 this is the rhombic dodec- 
ahedron). We determine the maximum degree of a given point in a Steiner 
minimal tree in this norm. The proof makes essential use of extremal finite 
set theory. The second norm, occurring in the work of Mark Conger (1989), is 
the sum of the fi-norm and a small multiple of the £2 norm. For the second 
norm we determine the maximum degree of a Steiner point. 



1. Introduction 

There is a vast literature on Steiner minimal trees, mostly in graphs, Hamming 
space and related word spaces, the Euclidean plane, and the Manhattan plane, 
with applications in VLSI design |2()| and phylogenetics [3]. See e.g. the mono- 
graphs 1181 I19| . Cockayne [S] considered Minkowski planes (two-dimensional 
normed spaces). Cieslik initiated the study of Steiner minimal trees in general 
finite-dimensional normed spaces (5J |5] . Recently, other norms besides Euclidean 
and Manhattan have found applications |4l 1221 1231 1241 1251 I26j . while the ge- 
ometry of certain high-dimensional normed spaces is related to the metric spaces 
found in the mathematical study of phylogenetic trees [5]. These trees are also of 
interest in differential geometry; see especially the work of Morgan and his students 

El Ell Q] Q3B EB. 

We denote an n-dimensional normed space or Minkowski space (R", ||-|j) by MP 
(see Section I2.rt|) . A Steiner tree T of a finite set of points N in M" is a tree with 
vertex set V — V(T) and edge set E — E(T) where N C V. The points in TV are 
called nodes or given points, and the points in V \ N Steiner points or auxiliary 
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points. The length of a Steiner tree T is 

£(T):= ]T ||x- V ||. 

xyeE(T) 

Any finite set of points TV* in a Minkowski space has at least one tree of minimum 
length HJ. Such a shortest Steiner tree of N is called a Steiner minimal tree (SMT) 
ofN. 

A star is an SMT such that one of the vertices, called the centre of the star, 
(either a Steiner point or a node) is joined to all the other vertices. The vertex 
figure of a vertex in an SMT is the set of vectors from the centre of the star to 
the other vertices. Note that only the directions of the vectors matter: If a set of 
non-zero vectors is a vertex figure in some SMT, then we may replace each vector 
with another in the same direction, to form a new SMT. In an earlier paper |37| 
we called the problem of describing the vertex figures of nodes and Steiner points 
the local Steiner problem. Here we give a solution of the local Steiner problem in 
Minkowski spaces, formulated in terms of norming functionals (Section 14.2(1 . 

We denote the maximum degree of a given point in an SMT in M™, with the 
maximum taken over all possible SMTs, by d(M n ). Let s(M n ) be the maximum 
degree of a Steiner point in an SMT in M n , where the maximum is taken over 
all possible SMTs in MP. If T is an SMT of N, then T is clearly still an SMT 
of any N' where N C N' C V(T). It follows that s(M") < d(M n ). We use the 
characterization of vertex figures to obtain information on these values for certain 
spaces. 

Cieslik |5j proved that rf(M") is bounded above by the Hadwiger number or 
translative kissing number of the unit ball B(M. n ), i.e., the maximum number of 
mutually non-overlapping translates of B(M n ) that all touch B(M n ). This already 
gives an upper bound depending only on n, since the Hadwiger number is bounded 
above by 3" — 1 |16| . He proved this more generally for all minimal spanning 
trees, a result rediscovered in |30| for the special case of £ p norms. Cieslik [5], 
Conjecture 4.3.6] also made the following conjecture: 

Conjecture 1 (Cieslik O E])- The maximum degree of a given point in an 
SMT in any n-dimensional Minkowski space M™ satisfies 

d{M n ) < 2 n+1 - 2, 

with equality if and only if M™ is isometric to the space which has unit ball 
conv([0,l] n U [-1,0]"). 

The unit ball of is an affine regular hexagon, and of M| an affine rhombic 
dodecahedron. The norm of M| is 

|| (a;(l), . . . , £c(n))|| = max x(i) — min x(i). 

x(i)>0 x(i)<0 

See Fig.^for two affine representations, one as in the conjecture, and the other with 
all faces congruent. In Section l2~4l we represent in a different, more convenient 
way. 

Since any edge joining two points in an SMT can be replaced by a piece-wise 
linear path consisting of segments parallel to the vectors pointing to the extreme 
points of the unit ball, we obtain the following well-known lemma. 

Lemma 2. If the unit ball o/M" is a polytope with v vertices, then d(M n ) < v. 
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Figure 1 . Two representations of the rhombic dodecahedron 

It follows that d{M n z ) < 2" +1 - 2. Cieslik |6j proved the case n = 2 of his 
conjecture. However, we show that d(M. z ) < 2™ +1 — 2 for all n > 3, thus partially 
disproving the conjecture. It is not difficult to see that d(M z ) < 14 by shortening 
the star connecting the origin to the vertices of the rhombic dodecahedron. If there 
exists an SMT with a given point of degree 14, then the star joining o to the vertices 
of the unit ball will be an SMT of the 14 vertices together with the origin. In Fig. [3 
on the left we have three of the fourteen edges from the origin to the vertices. These 
three vertices, one of degree 4 and two of degree 3, are on the same facet of the unit 
ball. The length of this subtree is 3. On the right we show that it can be shortened 
to a length of 5/2 by introducing two Steiner points. This sub-configuration is 
therefore forbidden in any star joining the origin to a subset of the vertices, and it 
easily follows that d(M z ) < 10. This turns out to be the correct value (Fig. 0. 

Theorem 3. For any n > 3, d(M|) = (, (n +t?/ 2 j)- 

The proof is in Sectional We also describe all possible vertex figures of a node 
in an SMT in M^. From the proof we obtain the following corollary. 

Corollary 4. In the Minkowski space M 7 ^, the star joining o to any set A of 
non-zero points is an SMT of AU {o} if and only the star joining o to any subset 
B of A of size at most 4 is an SMT of B U {o}. 

For all n > 2 there is, up to isometries, a unique configuration attaining the 
maximum degree fTheoreml23lin Section |5j. These spaces give the largest known 




Figure 2. Shortening a tree in the rhombic dodecahedral norm 
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Figure 3. A given point of degree 10 in the rhombic dodecahedral norm 

degrees of SMTs in Minkowski spaces of dimensions 2 to 6, in fact d(M|) = 10, 
d(M%) = 20, d(M|) = 35, and d(M%) = 70. For n > 7, d(i^) = 2™ is larger. It is 
not at all clear whether M| maximises d(M. n ). 

Morgan [28j p. 42], (29] p. 129] asks the following question: 

Conjecture 5 (Morgan gHHU). The maximum degree of a Steiner point in 
an SMT in any n-dimensional Minkowski space MP satisfies 

s(M n ) < 2". 

The asymptotically best known upper bound for both conjectures is 38 

s(M n ) < d(M") < c2"n 2 logn. 

It is known that s(M 2 ) < 4 for all planes |37| . There are many two-dimensional 
spaces attaining s(M 2 ) = 4. Some piece- wise C°°, elliptic examples were discovered 
in [J. They are characterised in |37j . 

We give a lower bound for s(M^) that is asymptotically correct up to a factor 
of 2 by Theorem El 

Theorem 6. For any n > 3, s(M|) > (i (n ™t)/2j)- Therefore, s(M£) = 
0(27v^)- ~ " 

The proof is in Sectional 

The sharp upper bound for differentiable norms is s(M n ) < <i(M") < n + 1 
[HI ESI- It is well-known that s(^) = ) = 2™ and s(Q) = d(£^) = 2n 
(see also Section^. For the £ p norm, 1 < p < oo, we have 3 < s(^™) < rf(£") < 7 if 
p > 2, n > 2, and min{n, (p _f )ln2 } < s{£™) < d(^) < 2p/(p~ 1 ) if 1< p < 2, n > 3; 
see |36| . where more detailed estimates are obtained. 

Conger [TU] showed that s(R 3 , |M|i + A||-|| 2 ) > 6 for all < A < 1. These norms 
are piece-wise C°° and elliptic. In pQ it is shown that s(R 2 , ||-||i + A||-||2) = 4 for 
all < A < 2 + \[2. The value A = 2 + V2 is sharp, since it follows from the results 
in that s(M 2 , ||-||i + A||-|| 2 ) = 3 for all A > 2 + y/2. We generalise these results 
as follows. 

Theorem 7. Let A > 0, and let M" = (M n , ||-||i + A||-|| 2 ). If X < 1 then 
s(M n ) < d(M n ) < 2n. IfX< y/n/(Vn- 1) then d{M n ) > s(M") > 2n. 
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The proof is in Section In this regard Conger made the following conjecture 
[29l p. 128]. 

Conjecture 8 (Conger). For any piece-wise differentiable, elliptic M™ we 
have s(M") < 2n. 

Our results are based on a characterization of the vertex figures of nodes and of 
Steiner points in SMTs in arbitrary Minkowski spaces (Section 0J. This character- 
ization is found using the subdifferential calculus (Sectional). The characterization 
of vertex figures of nodes enables us to reduce the determination of d(M^) to a 
purely combinatorial problem in extremal finite set theory (Section [SJ. The char- 
acterization of vertex figures of Steiner points in the case of is not so easily 
reducible to combinatorics, hence the partial results of Theorem|Hl(Section|HJ). This 
situation can be compared to |37) . where the characterization of nodes in an arbi- 
trary two-dimensional Minkowski space is much simpler than the characterization 
of Steiner points. However, the norm ||-||i + A||-||2 is sufficiently simple so that the 
maximum degree of a Steiner point can be determined (Section 0. 

In the next section we collect basic definitions used in the paper. 

2. Basic definitions 

2.1. Signed sets. Let [n] := {1, 2, . . . , n}, and let "P[n] be the set of all subsets 
of [n]. Let \A\ denote the number of elements in the finite set A. We use the 
following standard notation for signed sets (see [3]). A signed subset X of [n] 
is a vector X 6 {0, ±1}". We denote the ith component of X by X(i) (reserving 
subscripts such as Xi for indices). The positive part of X is X + := {i : X{i) = +1}, 
the negative part of X is X~ := {i : X(i) = —1}, the support of X is X_ := X + UX~ , 
and the zero set of X is X° := [n] \ X_. We also write X = (X + , X~). Thus any 
two disjoint subsets A, B of [n] determine a signed set (A, B). The natural partial 
ordering on signed sets X < Y is defined by X + C Y + and X~ C Y~ . Two signed 
sets X, Y are conformal if X + n Y~ = = X~ n Y + . Denote the empty signed set 
(0,0) by 0. 

2.2. Vector spaces. We consider K™ to be the vector space of column vectors 
x = (#(1), . . . , x(n)) T , with the standard basis e 1; . . . , e„ defined by e.j(j) = 5y. 
The support of a vector x € W 1 is the signed set supp(:c) := (supp + (a;), supp _ (ai)), 
where supp + (cc) = {i : x(i) > 0} and supp~(a;) = {i : x(i) < 0}. Let x + ,x~ be 
defined by x + (i) = max{a;(i),0} and x~{i) — min{:c(i), 0}. 

Let V C R™ be a subspace, and it : W l — > V a projection. Define supp(y) := 
{supp(a;) : x £ V}. This is the set of covectors of the oriented matroid of the vector 
configuration {n(ei) : i = 1, . . . , n} in V; cf. [5[ §2.2]. 

The dual V* of a vector space V is the set of all linear functionals ip : V — ► K. 
We identify (E™)* with the set of row vectors 99 = (<p(l), ■ ■ ■ , f{n)), with standard 
basis €i — ej . Let V be a subspace of R™, and let it : R™ — > V be an orthogonal 
projection. The quotient space M ra /ker7r can then be identified with V. The dual 
of R n /ker7r is in a canonical way isomorphic to the annihilator of ker7r, i.e., the 
subspace {(p : kern C ker<^} of (R™)*. We accordingly identify this space with V* . 
It is easily seen that with these identifications supp(V) = supp(y*) (as long as 7r 
is orthogonal). 

For any two vector spaces V and W, (V x W)* is canonically isomorphic to 
V* x W*, since a functional <p G (VxW)* may be decomposed uniquely as <p(x, y) = 
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ijj{x) + x{y) f° r some ip £ V* and x£ V*; in fact, ip(x) — ip(x,o) and xiu) = 
(p(o,y). Conversely, given any i/j £ V*,\ G W* , their sum ^>(cc) + xiu) defines a 
functional on V x W which we denote (ip,x)- 

If A and B are subsets of a vector space V, we define their Minkowski sum to 
be A + B := {a + b : a £ A, b £ B}. 

2.3. Minkowski spaces. A Minkowski space is a finite-dimensional vector 
space V together with a norm ||-|| on V, denoted by M = (V, ||-||), or M™ to indicate 
that it is n-dimensional. The unit ball B — B(M) = {x £ V : \\x\\ < 1} determines 
the norm uniquely, and we also write M = (V, B). 

The dual of M = (V, ||-||) is M* = (V\ ||-||*), with the dual norm defined by 

:= m&x{ip(x) : \\x\\ = 1}. 

Denote the dual unit ball by B* . It is well-known that the dual of the dual is again 
the original space. 

The £ p -norm on R" is defined by 

n 

||(x(l),...,x(n))|| p := El*Wn Vp 

i=l 

if p > 1, and 

||(aj(l), • • ■ ,x(n))||oo := niax{cc(i) : i £ [n]}. 
The space PI is (K n , ||-|| p ) = (K",-B™). The n-cube is B^, and the n- dimensional 
cross polytope is -B™. The spaces and are dual. 

By the (Hahn-Banach) separation theorem, each non-zero x £ M n has a norm- 
ing functional, i.e. a functional ip such that \\<p\\* = 1 and if{x) = \\x\\. We denote 
the set of norming functionals of a non-zero x by dx. Each dx is an exposed 
face of B* , i.e., the intersection of B* by some supporting hyperplane, in fact 
dx = B* n {</? : y(£c) = ||£g||}. A norm is differentiable if and only if dx is a 
singleton for each x ^ o (and then the norm is C 1 ). 

A norm is strictly convex if for all linearly independent x,y £ M™ we have 
|| sb + y|| < ||cc|| + ||y||. This is equivalent to there being no straight segment in the 
boundary of the unit ball (i.e., strict convexity of the unit ball). A norm is strictly 
convex if and only if the dual norm is differentiable. 

A norm ||-|| on V is elliptic if ||-|| — ||-||2 is still a norm for some Euclidean norm 
||-||2 (i.e. for any identification of V with R"). We also say that the unit ball of 
an elliptic norm is elliptic. If the norm is C 2 , this is equivalent to requiring that 
the boundary of the unit ball has positive inward curvature bounded away from 0, 
where the curvature is defined by any Euclidean structure on the space. A norm is 
elliptic if and only if its dual is C 1 ' . See |27l §3] for more on elliptic norms. 

Note that in functional analysis a differentiable norm is also called a smooth 
norm, while in differential geometry the word smooth is usually reserved for C°° . In 
differential geometry the unit ball of an elliptic norm is also called uniformly convex, 
which again has another meaning in Banach space geometry. We therefore avoid 
the terms smooth and uniformly convex in this paper, and stick to the terminology 
of the previous paragraph. 

If V is a subspace of M = (W, ||-||), then the quotient space M./V is the space 
W/V with norm ||as + V|| := inf{||a; + v\\ : v £ V}. If V is a subspace of M = 
(R™, ||-||), a concrete presentation of M/V can be obtained by using the orthogonal 
projection tt : R™ — > V . Then M/V is isometrically isomorphic to (V ,ir(B)), 



THE LOCAL STEINER. PROBLEM IN NORMED SPACES 



7 



i.e., we project the unit ball B orthogonally onto V 1 - to obtain the norm. The dual 
(M/V)* is as before the annihilator of ker7r = V, and the dual norm is a restriction 
of the dual norm ||-||* of M*, i.e., the dual is a subspace of M*, with the embedding 
given by the adjoint 7r* of the projection n. 

2.4. Zonotopes as unit balls. Denote the closed segment from a to b by 
[a,b]. A zonotope is a Minkowski sum of closed segments, equivalently, a zonotope 
is the projection of an n-cube onto a subspace. The zonotope determined by a 
subspace V of R n is the orthogonal projection Z = 7r(_B^,) of the n-cube onto V. 
As explained in Section 12731 (V, Z) is isometric to the quotient ^/ker7r, and its 
dual (V*,Z*) is isometric to the subspace im.7r* of l\. As before, we identify V* 
with this subspace. Then Z* — V* fl B". We need a description of the faces of 
Z* . Each proper face of Z* is the intersection of V* with a proper face of -B™, i.e., 
there is one for each signed set A E supp(V*), namely 

F'(X) := {ip eV* : = l.suppfoO < A}, 

of dimension |A| — 1. Since X i— > F'(X) is order-preserving, we have that F'(X) is 
a facet if and only if A is a maximal covector in supp(V*). Two faces F'(X) and 
F'(Y) belong to a common facet if and only if A and Y are conformal. 

We mention that the corresponding non-empty faces of Z are indexed by the 
same set supp(T^) = supp(V*). Each such face equals some 

i£X+ jex- kex° 

of dimension \X°\ (see p|l §2.2]). The mapping A i— > F(X) is order-reversing, and 
so F{X) is a vertex if and only if A is a maximal covector in supp(V™), while two 
faces F(X) and F(Y) intersect if and only if A and Y are conformal. 

We now consider the Minkowski space defined in the introduction. Let 7r 
be the orthogonal projection of K n+1 onto the hyperplanc 

n+l 

H :={xe K n+1 : £sc(i) = 0}. 
i=i 

This projection is along the vector j := (1,1,..., 1) T . Thus is the quotient 
i^ 1 1 ker7r, which we identify with the n-dimensional space Fl with unit ball Z n := 
tt(B 7 c ^ 1 ). It is easily seen that the norm is given by 

\\ x \\z : = h{ max x(i) — min x(i)). 

ie[n+l] ie[n+l] 

The dual space Mg, := (M|)* is the subspace 

n+l 

H* :={^e(M" +1 )*:^^e,) = 0} 

i=l 

of £i +1 , with support 

Z n := supp(iT) = {X e {0, ±1}" +1 : A+ ^ 0, A" ^ 0} U {0}. 

Note that F'(0) = 0, and all other F'(X) are non-empty proper faces of Z*. The 
facets of Z* are all the F'(X) with X e {±1}" +1 , X ± ^ 0. In general, 

F'{X) = { v e (R n+1 r : E ^) = h E ^ = ~h su pp(^) < 

i&x+ iex- 
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Also define Z n :— Z n U {1}, with 1 a symbol corresponding to the improper face 
Z*. We consider 1 > X for all X £ Z n . Thus (Z^, <) is the face lattice of Z*. 
It is easy to sec that Z* is the difference body of the regular n-simplex 

A„ :— convjei : i 6 [rt + 1]}. 

Thus Z| is a regular hexagon, and Z£ a cuboctahedron. 

To show that Z n is affinely equivalent to conv([0, l] n U [—1,0]"), let it' be the 
(non-orthogonal) projection of K™ +1 onto the hyperplane 

H' = {x£ R n+1 : x(n + 1) = 0}, 

again along the vector j. Then 7r'|# and tt\h' are inverses of each other taking Z n 
to 7r'(C" +1 ) and vice versa. Identifying H' with M" in the obvious way, it is easily 
seen that 7r'(C n+1 ) = conv([0, 2]™ U [-2, 0]™). 

3. Subdifferential calculus 

In this section we summarise the basic results of the subdifferential calculus 
needed in this paper. For more on subdifferential calculus, see |31j and |40| . We 
only consider convex functions defined everywhere in finite-dimensional spaces, so 
many of the arguments simplify. Although these results are all well-known in convex 
analysis, they are not easy to locate in the same reference, and the proofs given are 
usually based on a variety of other more general theorems. For the convenience of 
the reader we give direct proofs of these lemmas. We use the following version of 
the (weak) separation theorem (311 Theorem 11.3]. 

Separation Theorem. Let C\ and Ci be non-empty convex sets with disjoint 
relative interiors in a finite- dimensional real vector space. Then there exists a 
hyperplane H such that C\ and C2 are in opposite closed half spaces bounded by H. 

Let V be any Minkowski space with dual V*. As usual, a function / : V — ► K 
is convex if f(Xx+ (1 - X)y) < Xf(x) + (1-X)f(y) for all x, y € V and < A < 1. 
Convex functions on V are continuous on V. A functional (p £ V* is a subgradient 
of a convex function / at the point a if for all x S V, 

f(x) - f(a) > ip(x - a). 

In particular, o € V* is a subgradient of / at a if and only if / attains its minimum 
value at a. The subdifferential of / at a, denoted by df(a) 1 is the set of all 
subgradients of / at a. Note that for any ip G V* and eel, d(p + c)(a) = {ip} 
for all a G V. 

Lemma 9. The subdifferential of a convex function f : V — > R at any a £ V is 
non-empty, compact, and convex. 

See pHl Theorem 24.7]. 

PROOF. Since df(a) = C\ x(iV {p> € V* : tp(x - a) < f(x) - /(a)}, which 
is an intersection of closed half spaces in V*, we have that df(a) is closed and 
convex. If df(a) is unbounded, a compactness argument shows that it contains a 
ray {<po + Xipi : A > 0}. Therefore, f(x) — f(a) > po(x — a) + Xtpi(x — a) for all 
x 6 V and A > 0. It follows that pi(x — a) < for all x £ V, a contradiction. 

To show that df(a) 7^ we use the separation theorem to obtain a hyperplane 
that separates the point (a, /(a)) and the epigraph of / 

{(x,y):x£V,y>f(x)}CVxR. 
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Thus there exists a non-zero functional (f,r) £ (V x R)* = V* x E such that 
((p,r)(x,y) > ((p,r)(a, f(a)) for all a; 6 V and y > f(x). Then <^(cc — a) > 
r(/(a) — y), which gives that r > 0, since y can be arbitrarily large. If r = this 
also gives if — o, contradicting [ip, r) ^ (o, 0). Therefore, r > 0, and — ^<p(x — a) < 
f(x) - f(a) for all x £ V, giving <9/(a). □ 

Lemma 10 (Moreau-Rockafcllar Theorem). Le£ /; : V — ► R } i = 1, . . . , k be 
convex functions. Then for all a £ V , 

k k 

0(£ /0(a) =$>/,(£»), 

i=l i=l 

where the sum on the right is Minkowski addition. 
See jSTJ Theorem 23.8] and @D1 Theorem 2.8.7]. 

Proof. The "^''-inclusion is straightforward, but the "C"-inclusion needs the 
separation theorem (see the second proof on p. 224 of |31j1. We prove the case 
k = 2, with the general case following by induction. 

Let / and g be convex functions on V and let ip £ d(f + g)(a). Then 

f(x) - f(a) + g(x) - g(a) > tp(x - a) for all x £ V. (1) 

The sets 

Cx = {(x, r) € V X M : f(x) - f(a) - <p(x - a) < r} 

and 

C 2 — {(x,r) £ V x R : g(x) - g(a) < -r) 
are both convex and closed with non-empty interior, and by Q their interiors are 
disjoint. They have a common point (a, 0). By the separation theorem there exists 
a non-zero (tp,s) £ (V x R)* = V* x M such that (tf>,s)(x,r) > (i/>,s)(a,0) for all 
(x 7 r) £ C\ and (ip, s)(x,r) < (tjj,s)(a,Q) for all (x,r) £ Ci- This gives that for all 

X £ V, 

tp(x — a) > —sr for all r > f(x) — f(a) — tp(x — a), (2) 

and 

ip{x — a) < —sr for all r < g(a) — g(x). 

Since r can be arbitrarily large in (J2J, s > 0. If s = 0, then ip(x — a) = for all 
x £ V, contradicting (ip,s) ^ (o, 0). Therefore, s > 0, and we obtain 

-~ip{x -a)< }{x) - f(a) - >p{x - a) 

and 

\tp(x -a)< g(x) -g(a). 

This gives (p—\ip£ df{a) and ^tfj £ dg(a). Adding, we obtain if £ df(a) + dg(a), 
as required. □ 

Lemma 11. Let fi : Vi — ► R } i = 1, . . . , k be convex, and define f : V% x • • • x 
14 — ► R 6y f(xi, . . . , Xf.) = ^2ii—ifi{ x ij- Then the subdifferential of the convex 
function f is the Cartesian product 

k 

a/(ai, . , o fc ) = J] d M a i) C V x * x ■ • • x 14*. 
See @ni Corollary 2.4.5]. 
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Proof. By induction it is sufficient to consider the case k = 2. Let / : V — > K 
and 5 : — ► K be convex, define F : V x W — > E by F(x, y) = /(a;) + g(y), and 
choose a E V, b e W. 

First choose </? S df(a) and ^ e dg(b). Then /(x) — /(a) > </>(x — a) for all 
a G V, and g(y) — g(b) > ip(y — b) for all b E W. Adding, we obtain F(x,y) — 
F{a, b) > (tp, ip)(x, y) - (tp, ip)(a, b), which gives (tp, tp) S 8F(a, b). 

Now choose (tp, ip) € dF(a, b). This means that for all x E V and y G W, 

tp(x -a) + 4,{y -b)< /(x) - /(a) + 5 (y) - g{b). 
Setting y — b we obtain e df(a), and setting X = a gives S dg(b). □ 

We have already defined the set of norming functionals dx of a non-zero a; e M. 
We now extend this notation by defining <9o = B(M*). The reason is the following: 

Lemma 12. T7ie subdifferential of the norm o/M at a is given by d\\a\\ = da. 

See gOl Corollary 2.4.16]. 

PROOF. By definition, tp e B(M*) if and only if tp(x) < \\x\\ for all x e M, 
which is equivalent to e <9||o||. This takes care of the case a = o. 
If a ^ o, then for any y> 6 9a and any x 6 M, 

9?(x - a) = (/j(x) - || a. || < ||x|| - ||a||, 

hence ip £ 9||a||. Conversely, suppose 

||x|| - ||a|| > ip(x - a) for all x € M. (3) 

Setting x = o in |[SJ we obtain (p(a) > \\a\\, giving \\tp\\* > 1. Setting x = 2a in 
(PJ we obtain ||a|| > <p(a), giving tp(a) = \\a\\. Now © gives ||a;|| > tp(x) for all 
x e M, implying ||<p||* = 1. It follows that tp is a norming functional of a. □ 

The distance function p(x, y) := ||x — y|| is easily seen to be convex on M x M 
by the triangle inequality. 

Lemma 13. For any a, b e M, 

dp(a, b) = {(ip, -tp):tp€ d(a - b)} C M* x M*. 

Proof. Let tp € <9(a — 6). By Lemma IT2"1 we have for any x, y £ M, 

11^ — 2/11 — ll a — ^11 > vK 33 — y ~ a + &) 

= tp(x - a) - <p(y - b) = (tp,-tp)(x - a,y - b), 

which gives (tp, — tp) E dp(a,b). 

Conversely, let (tp, tp) G dp(a, b). Then, in particular, for any x G M, 

p(x, x) — p(a, b) > tp(x — a) + tp(x — b), 

i.e., (tp + ip)(x) < tp(a) + ip(b) — \\a — b\\. Since the right-hand side is independent 
of x, we obtain tp + tp = o. □ 

4. The local Steiner problem in Minkowski spaces 

Our characterization of vertex figures is in terms of so-called reduced Minkowski 
addition defined on subsets of the dual of a Minkowski space. 
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4.1. Reduced Minkowski addition. Wc define the reduced Minkowski sum 
of two closed, convex subsets C and D of the unit ball of (M™)* to be 

CmD = {Lp + i> : 99 € C,ip€ D, \\<p + 4>\\* < 1}, 

i.e. C ffl D is the usual Minkowski sum C + D intersected by the unit ball B* of 
(M™)*. Reduced Minkowski addition is clearly commutative but not associative, 
although it satisfies a weak associative law (see Proposition 1141 below). We now 
consider some elementary properties of this binary operation. Consider a finite 
non-empty family E = {Ai : i G /} of operands, where / C [k] and each Ai is a 
closed convex subset of B* . We call / the support of E. A parenthesization of E is 
defined to be a parenthesization, in the usual sense, of some ordering 

A J{1) ffl . . . ffl A m (4) 

of E, where j : [|/|] — > / is some bijection. Stanley calls this a binary set bracketing 
of E [35 p. 178], and its enumeration is called Schroder's third problem 33]. We 
denote a parenthesization of E by (E), and we also call the family I the support of 
(E). We may define a parenthesization of E recursively as follows. 

• If E is the singleton {A}, then (E) = Ai is the only parenthesization of 
E. 

• If E = Ei U E2, with Ei and E2 disjoint, and (E^) is a parenthesization of 
E, for each i = 1,2, then both (E) = ((Ei)ffl(E 2 )) and (E) = ((E 2 )ffl(Ei)) 
are equivalent parenthesizations of E. 

More generally, two parenthesizations of E are equivalent if they can be transformed 
into each other using the commutative law on any subexpression. This is an equiv- 
alence relation, and two equivalent parenthesizations clearly evaluate to the same 
set. The collection of equivalence classes of parenthesizations of a set E with sup- 
port I corresponds bijectively with the collection of abstract trees with |/| + 1 leaves 
labelled by the elements of {0} U /, and with |/| — 1 internal vertices of degree 3. 
We call such a tree a rooted abstract Steiner tree on /. The node is the root of the 
tree, node i corresponds to Ai for each ieJ, and each internal vertex corresponds 
to an instance of ffl in the parenthesization corresponding to the tree. Note that 
we do not distinguish between left and right branches (as is done in the well-known 
bijection between parenthesizations of ordered expressions and planted, trivalent 
plane trees, both counted by the Catalan numbers; see |35l Exercise 6.19(b),(f)]). 
Stanley calls this a binary total partition [351 Example 5.2.6] of the set /. The 
explicit construction of the bijection for our variant is the following. We denote the 
rooted abstract Steiner tree associated to (E) by ?b(E). 

• If (E) = Ai then To(S) is the tree joining and i. 

• If (E) = ((Ei) ffl (E 2 )) then T (E) is the tree obtained by identifying the 
vertices in To(Ei) and To(E 2 ) to a single vertex x (corresponding to the 
instance of ffl operating on (Ei) and (E 2 )), and joining x to a new root 0. 

It is clear that two parenthesizations of E are equivalent if and only if their associ- 
ated rooted abstract Steiner trees are equal. 

The solution to Schroder's third problem, i.e., the number a k of equivalence 
classes of parenthesizations with support [k], is the product of the first k — 1 odd 
numbers: 

fc-i 

a k = JJ(2i-l). 

1=1 
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This is seen as follows. Since a,k equals the number of rooted abstract Steiner trees 
on [k], i.e., trees with k + 1 leaves {0, 1, . . . , k} and k — 1 internal vertices of degree 
3, we have a\ = 1 and au+i — (2fc — l)ofc, since we may subdivide any of the 2k — 1 
edges of such a tree on [k] and join the new vertex to k + 1, to obtain such a tree 
on [fc + 1]. 

We remark that the fcth Catalan number can be derived from at- Since there 
are k—1 EE's in a parenthesization of {Ai : i G [As]}, there are 2 k ~ 1 parenthesizations 
in an equivalence class. This gives 2 fc ~~ 1 a/ £ parenthesizations (in the usual sense) 
of the ordered expression A^m EH . . . A^^), where ir is a permutation of [fc], taken 
over all permutations ir. Therefore, the number of parenthesizations of A% EH ... EH 
Ak is 2 fe ~ 1 afc/fc!, which equals the kth Catalan number. This derivation of the 
Catalan numbers is essentially the same as the classical combinatorial derivation of 
Rodrigues [32] . 

We define an abstract Steiner tree on / to be a tree with set of leaves /, and 
with |/| — 2 internal vertices of degree 3. The abstract Steiner tree associated to 
a rooted abstract Steiner tree on / is obtained by contracting the root and its 
incident edge. The abstract Steiner tree of a parenthesization (£) obtained in this 
way from Tq(£) is denoted by T(£). The number of abstract Steiner trees on [k] 
clearly equals a^-\. 

We call two parenthesizations weakly equivalent if they can be transformed into 
each other using the commutative law on any subexpression or the associative law 
on the whole expression, i.e., a bracketed expression 

(((Ex) EB <£ 2 » EB <£ 3 » 

may be transformed into 

«£ 1 >EH«£ 2 EH£ 3 »), 

and vice versa. It is again clear that two parenthesizations £ and £' are weakly 
equivalent if and only if their abstract Steiner trees are equal: T(£) = T(£'). 
It follows that the number of weak equivalence classes of parenthesizations on [k] 
equals ak—i- The operation EH has the following weak associativity property. 

Proposition 14. Let (£)i and (£) 2 be two weakly equivalent parenthesizations 
of £ = {Ai : i G /}, where each Ai is a closed convex subset of B(M*). Then 
o G (£)i if and only if o € (£) 2 - 

Proof. By the definition of weak equivalence, it is sufficient to show for any 
three A % C B(M*), i = 1, 2, 3, that o G A x EH (A 2 EH A 3 ) if and only if o G (A 1 EH 
A2) EH A3. However, since A, C B(M*), both statements are equivalent to o G 

4.2. The characterization. 

Theorem 15 (Nodes). Let N = {po,Pi, ■ ■ ■ ,Pk} be a set of points in a Min- 
kowski space M™. Then the star joining po to each pi, i £ [k], is an SMT of N if 
and only if (£) ^ for each parenthesization (£) of £ = {d(pi — po) : i € [As]}. 

Proof. =>: Consider any parenthesization (£) of £ and its associated rooted 
abstract Steiner tree Tb(£). We now turn this tree into a Steiner tree in M™. 
Associate leaf i with pi for each i e {0} U [k], and associate each of the k — 1 
internal vertices with a variable point Xi G M™, i G [As — 1], Denote this Steiner 
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tree by To(E)(cci, . . . , Xfe-i). Its length is 

L(x 1 , . . .,aj k _i) := ^(T (E)(asi, . . .,x fe _i)). 

The Steiner points Xi in Tq(E)(xi, . . . ,Xk-i) may coincide — this results in the 
tree being in fact a contraction of T (E). Note that L : M" x • • • x M" — > R is a 
convex function since 

L(xi, . . . ,x k -i) = ^ Pe(xt,...,x k -i), 

e6_E(T <£}) 

where p e {xi, ■ ■ ■ , Xk-i) = \\a—b\\, with a and b the two points in iVU{a;i, . . . , Xk-i} 
associated to the vertices incident to e. Each p e depends on only one or two of the 
variables a^. Since To(E)(po, . . . ,po) is the star joining po to all pi, i G [k], which 
is an SMT by assumption, L attains its minimum at (po, . . . , po). Thus 

o€ dL(p , ...,p ) = dp e (p ,...,po) (5) 

eeB(T (S)) 

by Lemma ITU1 By Lemmas II II 1121 and 1131 if e = XiXj, then 

dp e {Po, ■ ■ ■ ,Po) = {(o, . . . ,o, (p ,o, ... ,o, n^,o, ...,o):ipe B*} 

pos. i pos. j 

while if e = piXj, then 

dp e (po, . . . ,p ) = {(o, . . .,o,jp^,o, ...,o):ipe d(pi -p )}. 

pos. j 

By considering each coordinate i G [k— 1] of JSJ) we obtain a functional <p e G (M n )* 
for each edge e G E(T (T,)) such that 



B* if e = aj^ajj or e = po^i, 

- po) if e = p 4 :r,, i 7^ 0, 
• and for each Steiner point Xi, ip e = iff + ip g , where e is the incoming edge 
and /, g the two outgoing edges of Xi, when the tree is directed away from 
the root po. 

By induction on the definition of Ib(£) (= induction on subexpressions of (£)) we 
obtain that ip e G (£), where e = poXi is the root edge. This gives (£) 7^ 0. 

Consider any Steiner tree in M n on {po, . . . ,Pfe}. By subdividing points if 
necessary, we obtain a tree with leaves {po, ■ ■ ■ ,Pfc} and with fc — 1 Steiner points 
Xi of degree 3, some of them possibly coinciding with each other or with the pi. 
This tree is the rooted abstract Steiner tree of some parenthesization (S) of S. As 
in the "=>■" -argument, we obtain that (E) 7^ implies that o G dL(po, . . . ,po), i.e., 
L attains its minimum at (po, . . . ,Po), which is when the tree is To(E)(po, . . . ,Po), 
the star joining po to the other pi. □ 

Theorem 16 (Steiner points). Let N — {pi, . . . ,Pk} be a set of points in a 
Minkowski space M™ . Then the star joining po to each pi, i G [k] , is an SMT of N 
if and only if o G (E) for each parenthesization (£) of £ = {d(pi — Po) : i G [k]}, if 
and only if o G (£') + d(pk — Po) for each parenthesization (£') o/E' = {<9(pi— Po) : 
i G [fc- 1]}. 
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Proof. Since any parenthesization (E) is weakly equivalent to (E')ffl9(pfc— Po) 
for some parenthesization of (E'), Proposition! 1 41 gives the equivalence between the 
two conditions. We next show that the first condition is necessary and sufficient. 

=X This is similar to the proof of Theorem ll5l Consider any parenthesization 
(E) of E. Turn its associated rooted abstract Steiner tree To(E) into an abstract 
Steiner tree T(E) by contracting the root. We turn T(E) into a directed graph as 
follows. Denote the (new) edge into which the root was contracted by e, and give 
it both directions, denoting the two directed edges by e + and e~ . Give all other 
edges of T(E) a single direction away from e. This tree becomes a Steiner tree in 
M n as follows. Associate leaf i with pi for each i £ [k], and associate each of the 
k — 2 internal vertices with a variable point Xi £ M™, i £ [k — 2]. Denote this 
Steiner tree by !T(S)(xi, . . . , Xk—2)- Its length is the convex function 

L{x x , x fe _ 2 ) := £(T(E)(xi, . . . , x fc _ 2 )). 

Again note that the Steiner points a;, in T(E)(xi, . . . , Xfe_2) may coincide, and then 
the tree is a contraction of T(E). Again, L attains its minimum at (po, . . . ,po). 
Calculating the subdifferential coordinate-wise, we obtain a functional (pg £ (M n )* 
for each directed edge e of -E(T(E)) such that 

{B* if e is incident with two Steiner points, 

9{Pi — Pa) if e is incident with pi, i ^ 0, 

• ip e + = -<p e -, and 

• for each Steiner point Xi, ipg = tpp + (pg, where e is the incoming edge and 

f,g the two outgoing edges of Xi, with the convention that we ignore the 
outgoing e + or e _ if Xi is incident with e. 

Write (E) = (E + )ffl(E~), where e ± points to the subtree associated with (E ± ). Let 
I be the support of (E^), and for each i £ [k] let (pi = (pg, where e is incident with 
Xi. Again by induction on subexpressions we obtain that <p e ± = ~^2 ie j± <Pi £ (E^). 
From ip e + — —tp e - it follows that o £ (E). 

<=: Similar to the corresponding direction in the proof of Theorem If 51 □ 

By the discussion in Section 14. fl when applying Theorem If 51 (Theorem If 6f) 
there are Ofc (a&_i, respectively) parenthesizations to consider. 

Note that in the -directions of the above proofs we did not need the parts 
of Lemmas to ED depending on the separation theorem, i.e., Lemma El and the 
"C"-part of Lemma Hill These directions of Theorems If 51 and If 61 are used to obtain 
lower bounds for d(M") and s(M n ). On the other hand, to obtain upper bounds 
we need the "^"-directions, where the separation theorem is needed. 

Recall that if the norm is differentiable, then 9||x|| is a singleton whenever 
x =/= o. This drastically simplifies the conditions in Theorems If 51 and If 61 and we 
regain the following two results. 

Corollary 17 (Lawlor and Morgan j2j). Let N — {pi, . . . ,p k } be a set of 
points in a Minkowski space M™ with differentiable norm. Let (pi be the norming 
functional of pi — Po, i £ [k]. Then the star joining po to each pi, i £ [k], is an 
SMT of N if and only if 

k 

& = ° 

i=l 
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and for each subset I C [k], 



i£dr<i- 



Corollary 18 (36). Let N = {p )Pl, • • • ,Pfc} &e a set o/ points in a Min- 
kowski space M™ with differ entiable norm. Let (ft be the norming functional of 
Pi — Pq, i = 1, . . . , k. Then the star joining po to each pi, i — 1, . . . , k, is an SMT 
of N if and only if for each subset I C [k], 

E>ir<i- 

iei 

5. Using extremal set theory to prove Theorem |3] 

In order to determine d(M^) we apply Theorem^] It follows from the discus- 
sion in Section that norming functionals are described as follows. 

Lemma 19. For any non-zero x E M^, we have dx = F'(X), where X E Z n 
is the unique signed set such that F(X) is the face of Z n which has ^\\ z x * n 
relative interior. 

We thus have to determine for which families £ = {Xi : i E [k]} C Z n all 
parenthesizations (-F'(E)) of F'(S) := {F'(Xi) : i E [k]} are non-empty. To this 
end, we define a commutative, non-associative binary operation □ on Z n as follows: 
• X B 1 = 1 □ X = X for all X E ZZ, and 




Extending F' to Z n by defining = {o} C Mq, we obtain the following. 

Lemma 20. For all X, Y E Z n , 

F'(X BY) C F'{X) ffl F'(Y). 

Furthermore, if at least one of the conditions X = 1. Y = 1, X + D Y~ = or 
X- H F+ = Zio^ds, i/ien 

BY) — F'(X) ffl 

Proof. We assume that X,Y ^ 1, otherwise the lemma is trivial. 

Consider the first relation. Since this is trivial if X B Y = 0, we assume that 
X and Y are not conformal, say that X + n T~ 7^ 0. There are now two cases, 
depending on whether X~ n Y + is empty or not. 

• If X- R Y + = 0, we have to show that F'(Y+,X-) C F'(X) ffl F'(Y). 
Choose any i E X + P\Y~ . Then for any ip E F'(Y + , X~) we have </?(ei) = 
0. By setting ip — tp~ + ^ € F'PQ and X = ¥> + - \d e we 
obtain c/? = V> + YG J F'pOffl J F 1 '(r). 

• If X- n T+ ^ 0, we have to show that F'(l) C ffl F'(Y). Choose 
any i E X + n Y~ and j S X~ n T+. Then ±e 4 - G F'(X) and 
-|ei + |ej e F'(Y), giving o G F'(X) ffl F'(Y). 
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This establishes the inclusion. 

Now consider the equality. Without loss assume X + DY~ = 0. Let ip E F'(X) 
and ip S F'(Y) be such that \\(p + < 1. Then for each i S X + we have 
< ip(ei) < (p(ei) +ijj(ei) and for each i E Y~ we have > <p(ei) > ip(ei) +ip(ei). 
In particular, X + n supp _ (</3 = and F~ n supp + (<^ + -0) = 0, and 

1 > \\(p + i/?\\i 

(v(ei)+^(ei))+ J! -(V»(ci)+^(ei)) 

i£supp+ (p+V>) iGsupp- (c/3+i/>) 

> X! (V( e i)+V'(ei))+ -(<^(e l )+V'(e l )) 
= 2 + ^ + 2 + H 

> 1. 

Thus equality holds everywhere, giving Hy + V'lli = 1; supp + (<p + "0) C X + , and 
supp-((^ + ^p) ^ Y~ . Therefore, ip. + ip E F'(X+,Y~). If rn7+/ 0, then 
F~) = X BY, and we are done. Otherwise, X~ n F+ = 0, giving that X and 
y are conformal. Then F'(X BY) =9, and also F'(X) ffl F'(F) = 0, since 
and F'(Y) belong to the same facet of Z*. □ 

Theorem 21. Lei £ = {JQ : i € [fc]} C Z n \ {0}. T/ien i/ie following are 
equivalent. 

(a) Some parenthesization of F'(T,) is empty. 

(b) Some parenthesization of E (with operation □) equals 0. 

(c) There exist indices a, b,c,d E [k] with {a, b} n {c, c?} = suc/i i/iai U 

x+)n(x 6 -ux d -) = 0. ' 

Proof. The implication @ =>• (jb]) follows from Lemma since F'(X) = 
implies -X" = 0. 

l|b| =4> (|Cj|: Let (E) = 0. By the definition of □, for some subexpression 
(Si) □ (E 2 ) of (E) we have (Si) ^ ^ (E 2 ) but (E x ) □ (E 2 ) = 0. If both £ x and 
E 2 are singletons, say Ei = {Xi} and E 2 = {Xj}, then Xj and are conformal, 
i.e., Xf n Xj = = n Jr, so we may take a = b = i and c = d = j. 

If Ei is a singleton {X a } but E 2 is not, then by induction on the definition of 
□ there exist X c ,X d E E 2 such that (E 2 ) = (X^,X7) is conformal with X a , i.e., 
X+ H X^ = = X+ n X~. Thus we may set b = a. A similar argument takes care 
of the case where E 2 is a singleton. 

If Ei and E 2 are both not singletons, then again by induction on the definition 
of □ there exist X a ,X b E Ei and X c ,X d E E 2 such that (E x ) = (X+,X b ~) and 
(E 2 ) = (Jf+,X7) are conformal, giving @. 

(|Cj) =^> (jsj: It is sufficient to find a parenthesization of a subfamily of -F'(E) 
that equals the empty set. First consider the case where a, b, c, d are distinct 
indices. Without loss assume that F'(X a ) ffl F'(X b ) ^ ^ F'(X C ) ffl F'(X d ). By 
Lemma I2UI we have F'(X a ) ffl F'(X b ) = F'(X a □ X b ), hence X„ □ X b ^ 0, and 
X a □ X 6 = (X+,X 6 -). Similarly, F'(X C ) ffl = F(X+ t Xj), and again by 
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Lemma EDI 

(F'(X a ) ffl F'(X b )) ffl (F'(X C ) ffl 
= F'(X+,X-)fflF'(X+,X-) 
= F'(0) = 0. 

Similarly, 

• if a = b and c = d, then F'(X a ) ffl F'(X C ) = 0, 

• if a = 6 and c ^ d, then F'(X a ) ffl (F'(X C ) ffl F'(X d )) = 0, and 

• if a ^ 6 and c = d, then (F'(X a ) ffl F'(X 6 )) ffl F'(X C ) = 0. □ 

The above theorem together with Theorem ^| now gives the following. 

Corollary 22. Let P = {p, t : i £ [k]} be a family of points in M§ \ {o}, with 
\\Pi\\z Pi * n the relative interior of face F(Xi) of Z n . Then the star connecting P 
to o is an SMT of P U {o} if and only if there do not exist indices a, 6, c, d £ [k] 
with {a, b} n {c, d} = and (X+ U X+) n (A" 5 - U Jf7) = 0. 

Consequently, the star connecting P to o is an SMT of P L) {o} if and only if 
for any I C [k], 2 < |7| < 3, i/ie siar connecting {pt : i £ L} to o is an SMT of 
{Pi :i£l}U{o}. 

This also implies Corollary 01 The problem of determining d(M|) has now 
been reduced to a problem in extremal finite set theory. 

Theorem 23. Let {JQ : i £ [k]} be a family of signed sets from Z n such that 
there do not exist indices a, 6, c, d £ [k] with {a, 6}(~l{c, d} = and (Xj"UA r ( t)n(A r b "U 
-2^7) = 0. T/ien k < (i( n ^2W2j) with equality if and only if all Xf = 0, and either 
all\X+\ £ {L(n+1)/2J, L(n+1)/2J+1}, or all\X+\ £ { \(n+ 1)/2] - 1, r(n+l)/2]}. 

Proof. It is easily seen that the hypothesis is equivalent to the following 
statement: For all families of sets {Yi : i £ [k]} C V[n + 1] \ {0, [n + 1]} such that 
X+CYiC [n + l]\Xr, 



(6) 



there do not exist indices a 7 b,c,d £ [k] 
with {a, 6} n {c, d} = and F U Y c C Y fe n F d . 

Property © is in turn equivalent to the following three conditions: 

all Yi are distinct sets, (7) 
there do not exist distinct a,b,c £ [k] with Y a C Yj, C Y c , and (8) 
there do not exist distinct a, b,c,d£ [k] with F a U Y& C F c n Y<j. (9) 

By a well-known generalisation of Sperner's theorem due to Erdos (131 12], condi- 
tions 10 and ijHJ on their own already give the sharp upper bound k < (^„+^w 2 j) 
with equality exactly when {Yi : i £ [k]} consists of two of the largest levels in 
V\n + 1] . Because of this rigidity, it easily follows that in the case of equality all 
Xf = 0, finishing the proof. □ 

TheoremOlis now proved. We remark that conditions (Q) and © on their own 
give the same upper bound, but with an extra case of equality when n + 1 = 4; see 

eh. 
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6. A geometric formulation of Sperner's theorem 

The next lemma follows from an observation of Moore |15j . 

Lemma 24. If the unit ball o/M" contains k points on its boundary such that 
the distance between any two equals 2, then s(M n ) > k. 

From this lemma together with Lemma [21 it immediately follows that s(£% ) = 
d(i^) = 2™ and s(^) = d(^) = 2n. We apply this lemma to M™. Recall that 
a vertex of Z n equals some F(X), where X° — and X^ ^ 0. Thus X can be 
identified with the set X + C [n + 1], not equal to or + We then have for 
any distinct X+, Y+ e V[n + 1] \ {0, [n + 1]} that 



\F(X+) - F(Y~ 



1 if X+ C Y+ or Y+ C X+, 

2 if X+ % Y+ and Y+ % X+ . 



It follows that a set {F(Xi) : i G 1} of vertices of Z n are all at pairwise distance 
2 if and only if {X^ : i S J} is an antichain. It follows from Lemma [2] that 
s(M|) > ( I („+tv2| ) ' This establishes Theorem El 

By Sperner's theorem % we cannot do better: The largest set of vertices 
at pairwise distance 2 has size (i(„"^w 2 j)' We still cannot do better even if wc 
consider arbitrary points on the boundary of Z n . For any boundary point p of Z n 
its support X p := supp(p) describes the unique face F(X p ) which contains p in 
its relative interior. Two boundary points p and q are at distance 2 if and only if 
there exist parallel supporting hyperplanes at p and q with Z n in between. This 
in turn is equivalent to p and — q being contained in the same facet F(X) of Z n . 
Note that F(X) is a facet of Z n if and only if |JSC ± | = 1. Therefore, ||p - q\\ z = 2 
if and only if X+ n X q ^ and X p D X+ ^ 0. Now let P = {p t : i e 1} be a 
set of boundary points of Z n at pairwise distance 2, let X, = supp(pi), and choose 
Yi G V[n+ 1] \ {0, [n+1]} such that X+ C C [n + 1] \ ir. Then it follows that 
the Yi are all distinct sets, and form an antichain. As before we have by Sperner's 
theorem that |P| < (i^+iyaj)) with equality if and only if all pi are vertices and 
all Xf have the same cardinality, either [{n + 1)/2J or \{n + l)/2] . We have shown 
the following. 

Proposition 25. The largest number of unit vectors in at pairwise distance 

2 is (L(n+l)/2j)- ^ 

An even more general result would be the following. 

Conjecture 26. The largest size of an equilateral set in equals {u^um) ■ 
This conjecture is known to hold for n = 3 34 (and is easy for n = 2). 



7. Perturbations of £™ 

The following lemma is standard. 

Lemma 27. Let ||-|| Q and \\-\\b be any norms on R™. Then the dual of M. n = 
(R n , ||-|| a + ||-||b) is isometrically isomorphic to ((R™)* , B*+ B£), where B* and B^ 
are the unit balls of the dual norms ||-||* and \\-\\l- 
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PROOF of Theorem Let ||-|| = ||-||i + A||-||2. Bv Lemmal2"?lwe mav identify 
the dual unit ball B* with X)"=i[ — e «i £ i] + XB%. Recall that each 9||se||, x ^ o, is 
an exposed face of B*. Any exposed face of B* equals a proper non-empty face F 
of the cube u] translated by a functional <p with \\1pW2 — A, such that F 

and <£> have the same sign in the following sense: if F = ^2 ieX + e * ~ Tlii<=x- e * + 
J2iex° e i\ f° r some signed set X ^ 0, then X = supp(ip). 

We first show d(R n , ||-||) < 2n if A < 1. Let p u . . . ,p k e M", p, ^ o, and 
suppose that the star joining o to all Pi is an SMT of {0} U {pi : i 6 [k]}. Then by 
Theorem 1151 all restricted Minkowski sums of the <9||Pi|| must be non-empty. Let 
<9||pi| = Fi + ifi as above, with Xi the corresponding signed set. Suppose k > 2n. 
Since all Xi 7^ 0, it follows from the pigeon-hole principle that for some two indices 
i,j, Xi and Xj have a common element of the same sign, say 1 € X^~ n X%. Then, 
since F\ and Fi are both contained in the hyperplane {\ ■ x( e i) = 1}j it follows that 
+ 9||p2|| is contained in the hyperplane {x ■ x( e i) — 2 + <pi(ei) + ^(ei)}- 
However, B* is contained in the slab bounded by ±{x : x( e i) = 1 + A}. Since 
2 + <pi(ex) + (f2{ei) > 2 > 1 + A, we obtain 9||pi|| ffl 0||P2|| = 0, a contradiction. 
Therefore, k < 2n. 

We now prove that s(R", ||-||) > 2n for A < y/n/(y/n— 1) by showing that the 
star joining o to all ±ej, i € [n], is an SMT of {ffle^ : i € [n]}. This is trivial for 
n = 1, so we assume from now on that n > 2. We have 

n 

d\\ei\\ =Ei -^[-e^e^ + Cl + A)^. 
3=1 

By Theorem 1161 it is sufficient to prove that for each parenthesization (E) of 

E := {±E l : i G [n - 1]} U {£„} 

we have o G (E) — £„, or equivalently, £„ n (E) 7^ 0. 
Write 

(E) = (E0 ffl (E 2 ) - ((Er) + (E 2 )) n B* 

where Ei U E 2 — E, Ei n E 2 = 0, and E n 6 Si. Since E n C B*, we only have to 
prove that E n n ((Ei) + (E 2 )) 7^ 0. We now replace the operation ffl in (Ej) by ffl', 
where 

C ffl' £> := (C + £>) n [-1 - A/Vn, 1 + A/V^] n . 

Then C ffl' D C C ffl D, since [-1 - X/y/n, 1 + A/y^T C B*. Denoting the 
parenthesizations with respect to ffl' by (Ej)', it is sufficient to prove that E n n 
((Ei)' + (E 2 )') 7^ 0. Since all E t as well as [-1 - X/y/n, 1 + X/^/n\ n are Cartesian 
products, we may show this coordinatewise. For C, D C R, let 

Cffl"L> := (C + D)n[-l-X/Vn,l + X/y/n\, 

and for any family E of subsets of R™, let 7Tj(E) denote the family {-Ki(A) : A E E} 
of subsets of R, where 7Tj : R" — * R is the ith coordinate projection. We have to 
show the following: 

l + Ae<7r„(Ei))" + <7r n (E 2 ))", (10) 

and for all i € [rc — 1], 

[-i,i]n( 7 r l (E))'V0, (11) 

where the parenthesizations are with respect to ffl". First note the following. 
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Claim 1. Any parenthesization with respect to ffl" of one or more sets all equal 
to [—1,1] contains [—1,1]. 

It follows from Claim ^ an d induction that 

Claim 2. Any parenthesization with respect to ffl" of two or more sets all but 
one equal to [—1, 1], and the remaining set equal to {1 + A}, contains [A, 1 + X/y/n\. 

The interval [A, 1 + X/^/n] is non-empty since A < \fnj (^/n — 1) by hypothesis. 

Since 7T„(Ei) consists of [—1, l]'s (perhaps none) and a {1 + A}, we obtain from 
ClaimElthat either (7T„(£i))" D [A, 1 + X/y/n\ or (tt„(Ei))" = {1 + A}. Similarly, 
since 7r n (E2) consists only of [—1, l]'s, we have (7r„(E2))" 3 [—1, 1] by Claim^ an d 
(HHJ follows. 

Let i E [n — 1]. Then 7ri(E) consists of a {1 + A}, a { — 1 — A}, and 2n — 3 
[— 1, l]'s. The parenthesization (^(E))" has a unique subexpression (7Ti(E ))" = 
(^(Si))" ffl" (TTi(E 2 ))" such that {1 + A} e 7Tj (Ei ) and {-1 — A} e TT^Ea). By 
Claim^l either (tt^Ei))" D [A, 1 + \/y/n\ or (tt^Ei))" = {1 + A}. Similarly, either 
(7Ti(£ 2 ))" 3 [-1 - A/Vn,-A] or (7r 2 (E 2 ))" = {-1 - A}. This gives four cases, 
namely 

• (7Ti(£ ))" 12 [-1 + A - X/yfii, 1 - A + X/y/n\, 

• (Tri(Eo))" 3 [-1,-A + A/V^], 

• (^(E )>" 2 [A-A/v^,l], 

• <7Ti(£o)>" 3 {0}. 

In all four cases it follows that (^(Eo))" n [—1, 1] ^ (the first three cases because 
A < 1/(1 — /y/n)), and we obtain (flip from the following claim, which is proved 
similarly to Claim [2 

Claim 3. Any parenthesization with respect to ffl" of two or more sets all 
but one equal to [—1, 1], and the remaining set having non-empty intersection with 
[—1,1], has non-empty intersection with [—1,1]. □ 
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